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We construct new Ginsparg- Wilson fermions for QCD by inserting an ap- 
proximately chiral Dirac operator — which involves ingredients of a perfect 
action — into the overlap formula. This accelerates the convergence of the 
overlap Dirac operator by a factor of 5 compared to the standard construc- 
tion, which inserts the Wilson fermion as a point of departure. Taking into 
account the effort for treating the improved fermion, we are left with an total 
computational overhead of about a factor 3. This remaining factor is likely 
to be compensated by other virtues; here we show that the level of locality is 
clearly improved, so that the exponent of the correlation decay is doubled. We 
also show that approximate rotation invariance is drastically improved, but a 
careful scaling test has to be postponed. 
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1 Introduction 



Over the recent years, substantial progress has been achieved in the long- 
standing problem of constructing a formulation of chiral fermions on the 
lattice. It turned out that there is a particularly harmless way to break the 
full chirality of the the lattice Dirac operator, so that the physical properties 
related to chirality are still represented correctly. This breaking term is suf- 
ficient to circumvent the Nielsen-Ninomiya No-Go theorem |IJ, which refers 
to full chirality, in the sense that the lattice Dirac operator D anti-commutes 
with 75. This harmless breaking is characterized by the Ginsparg- Wilson 
relation (GWR) § 

{D, 7s } x>v = 2(D'y 5 RD) x , y , (1) 

where the kernel R has to be local and it must not anti-commute with 75. 
The standard choice for this term reads 

Ky = YJ x > y ' ^ > ° ' (2) 

The general class of solutions to the GWR with the standard kernel R st 
is given by the so-called overlap formula || 

A ov + /i 

hA /\/aIa 

Do- fx. (3) 

For Dq one may insert some sensible lattice Dirac operator. It has to be local 
(in the sense that the correlations decay exponentially) and free of fermion 
doubling, but we do not require any form of chirality for D . The resulting 
operator A is then transformed into an operator A ov , so that j L A ov is unitary. 
This provides the overlap Dirac operator D ov , which solves the GWR. The 
choice of the mass parameter \x is constrained; inside its allowed interval it 
can be optimized with respect to certain criteria, see below. 

From eq. (|3|) it is obvious that the spectrum of a GW fermion — with 
the standard kernel given in eq. @ — is always situated on a circle in the 
complex plane with center and radius \x\ we denote it as the GW circle. This 
condition for the spectrum is equivalent to the GWR with kernel R st . 

In almost all the literature, the Wilson operator D\y was inserted as Dq 
into the overlap formula, without consideration of other options. We denote 



D v 
A 
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the resulting particular overlap fermion as the Neuberger fermion. As any 
Ginsparg- Wilson fermion (GW fermion), it has a chiral symmetry which is 
lattice modified but exact ||. Since the corresponding transformation is 
local — based on the locality of R — the chiral limit is not plagued by mass 
renormalization. 

The generalization of the overlap fermions to a whole class of solutions 
of the GWR — and the motivation for considering alternative operators 
D 7^ Dw - - were given in Ref. 0. In fact, there are further issues of 
importance for a lattice fermion formulation, such as the quality of the scaling 
behavior, the level of locality and of approximate rotation invariance. With 
these respects, the Neuberger fermion is not quite satisfactory, even though 
it is local up to moderate coupling strength JJ. All these properties depend 
on the choice of D , hence it is motivated to search for a better starting 
operator than D w . 

Moreover, the practical evaluation of D ov is a formidable numeric task. 
In particular, the quenched simulation of the Neuberger fermion is about as 
expensive as the simulation of dynamical Wilson fermions |7]], i.e. the com- 
putational overhead amounts to about two orders of magnitude. Also with 
this respect a better choice of D can help. Our guide-line is the following 
observation: if we insert a GW fermion (with R st ) as -Do, then it is simply 
reproduced by the overlap formula ([3]), D ov = D . [] Therefore, in practice 
we try to insert an approximate GW fermion as -Do 0) so that it is altered 
only a little by the overlap formula, 

D ov « D . (4) 

This small alteration corresponds to a chiral correction (in the sense of the 
GWR). Q 

Thus the first issue is to construct an approximate GW fermion "by 
hand". At this point, we recall that also perfect and classically per- 
fect fermions solve the GWR. Their construction is very tedious, but in 
the present context we can use a relatively simple approximation, namely 
a hypercube fermion (HF). The mass renormalization is still quite strong 

1 Of course, this property holds for any fixed GW kernel R, if one uses the suitably 
generalized overlap formula pj . 

2 Similarly, one could also insert an approximately chiral Dq in the 4d space of domain 
wall fermions JB], ||] . 
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for simple HFs |fT0| , IT|, which is a problem in their direct application [|T^ 
However, here we reach the chiral limit by inserting the HF into the overlap 
formula. 

Of course, the (classically) perfect fermion has further virtues in addi- 
tion to chirality. In particular its scaling behavior is (almost) free of lattice 
artifacts, i.e. dimensionless ratios of physical observables are (almost) inde- 
pendent of the lattice spacing. Moreover the observables are also rotational 
invariant for perfect fermions, and rotation symmetry is approximated very 
well for the truncated perfect hypercube fermions, as the pion dispersion 



shows | ID| . Since the overlap formula modifies the HF just a little — see 
relation @ — we expect the good scaling and rotation behavior to persist 
under the chiral correction. Then we obtain an improved overlap fermion. 

There is yet another virtue to be expected based on relation (f|): the 
hypercube fermion is short-ranged — its free couplings are restricted to a 
unit hypercube on the lattice — hence we also expect a high level of locality 
for the overlap-HF (given by D ov if we insert D = Dhf)- Due to the modest 
alteration, the long range couplings can be turned on just slightly, hence their 
exponential decay will be fast. The Wilson fermion is also short-ranged, but 
it changes a lot in the transition to the Neuberger fermion, so we do not 
have the above reason to expect a good locality. Indeed, even for the free 
Neuberger fermion the decay of couplings (in the separation of fermion and 
anti- fermion) is a rather slow exponential [||]. 

All these expected virtues of the overlap-HF have been tested and im- 



pressively confirmed in the 2-flavor Schwinger model [13|. Now we carry on 
this program to QCD. In Section 2, we first describe the construction of a 
suitable HF in d — 4, and we illustrate its proximity to a GW fermion by 
evaluating typical fermionic spectra on small lattices. In Section 3 we discuss 
the polynomial evaluation of the inverse square root in eq. (|3|). In Section 
4 we investigate the speed of the numeric evaluation of the overlap formula. 
We show that for our HF a polynomial of a low order is sufficient to approx- 
imate the sign function or the inverse square root to a high accuracy. For 
the Neuberger fermion the same accuracy requires a polynomial of a much 
higher degree. Section 5 gives results for the condition numbers and their 
impact on the convergence rate, now proceeding to larger lattices. Section 6 
presents a comparative study of the level of locality, i.e. of the exponential 
decay of the maximal correlation at large distances, and Section 7 compares 
the violations of rotation invariance. Section 8 contains a summary and our 
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conclusions. 



2 Approximate Ginsparg- Wilson fermions in 
QCD 

For free or perturbatively interacting fermions, perfect actions can be con- 
structed analytically fl3| . In the case of non-perturbative interactions, this 



is possible only numerically and only in the classical approximation, f] The 
renormalization group transformation leading to the (quantum) perfect ac- 
tion would involve effectively a continuum functional integral. Still its exis- 
tence is of conceptual interest. It implies for instance that also a topological 
lattice charge without lattice artifacts exists |]IE|| , and that supersymmetry 



can in principle be represented continuously on the lattice | P7fl . Similarly, 
the perturbatively perfect action shows that e.g. the axial anomaly is re- 
produced correctly ]18|| , but only a modest improvement persists on the non- 



perturbative level 0, . What is (in principle) accessible and promising for 
simulations is an approximation to the classically perfect action ||20| , which 
works well for fermions in two dimensions ||21|| . However, the difficult con- 



struction and application in d = 4 is still under investigation [T2\. In that 
case, there is also a potential for an excellent scaling, but in order to obtain 
a sufficient chiral quality, it seems that the group working on it (the Bern 
collaboration) also depends on the concept of Refs. 0, [13]] (chiral correction). 
In the free case, the truncated perfect HF has still an excellent scaling 



behavior [K], |23j , if the renormalization group parameters are optimized for 
locality. Hence we use the free HF as the point of departure in our attempt 
to construct a HF, which is an approximate GW fermion, and which is also 
promising with respect to scaling and approximate rotation invariance. A few 
elements for its gauging are then added such that the GWR is violated only 
modestly at the coupling strength of interest. In this Section, we are going 
to describe this construction step by step. A synopsis of this construction 



has been anticipated in Refs. p4| 



The concept formulated in Refs. || [L3| has also been adapted in Ref. 
[25|j , where some progress is reported, although a very simple ("planar") 
operator D Q was used, which is quite far from a GW fermion even in the 



3 For recent work on perfect actions for semi-classical effective actions, see Ref. (TO 
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free case. With this "planar overlap operator" some results for the finite 
size scaling of the chiral condensate of Ref. [26] were reproduced, and the 
effects of instantons in the chiral symmetry breaking were reconsidered P7] ]. 



Other very simple non-standard operators D were used in Refs. [[28], 
In contrast, a very complicated approximate Ginsparg- Wilson fermion was 
constructed in Ref. |3(J by introducing many parameters and tuning them 
for a minimal GWR violation at a specific value of (3. This corresponds to 
the first part of our program (constructing an approximate GW fermion by 
hand), but since the overlap formula is not used, chirality is still not exact. 
Moreover, there are no ingredients in favor of improving other properties. 
However, that work reports some gain if a specific improved gauge action is 
used. Different improved gauge actions were applied in Refs. |22|, [Hj]. The 
use of improved gauge actions is also on our agenda, but in the present work 
we always use the standard plaquette gauge action. This allows us to observe 
unambiguously the progress due to the improved Dirac operator. 

All the considerations below are based on quenched configurations on pe- 
riodic lattices of sizes 4 4 , 8 4 (Sections 2, 3, 4) and 12 4 (Sections 5, 6 and 7), 
and beyond Section 2 we always use (3 — 6. 

Step 1: Minimal gauging 

As we mentioned above, we start from the couplings of the free perfect 
fermion. It is optimized for locality, and then truncated to a HF by imposing 
periodic boundary conditions. We write the resulting Dirac operator as 

D(x - y) = p^x - y)j„ + X(x - y) , (x, y G ^ 4 , \x v - y v \ < 1) (5) 

and the couplings of the vector term p^ and the scalar term A are given 
in Ref. Table 1. Note that the components of x — y are restricted to 



— 1,0, 1, and that p M is odd in the //-direction and even otherwise, while A 
is entirely even. In the free case, this HF is an excellent approximation to a 
Ginsparg- Wilson fermion ||. 

Our first HF for QCD, with a Dirac operator of the form 

D(x,y,U) = p fl (x,y,U)j fJ , + X(x,y,U) , (U : compact gauge field) (6) 

is now obtained by "minimal gauging" of the free HF: the sites, which are 
coupled in the HF formulation, are connected by shortest lattice path only. 
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The gauging is done by simply attaching the free coupling to these paths, 
divided into equal parts where several shortest paths exist. 

This simple prescription already provides an excellent pion dispersion 
relation flOf . On the other hand, with such a gauging the HF suffers from 
a strong additive mass renormalization |I| p] , comparable to the Wilson 
fermion. We illustrate this at strong coupling in Fig. p]: it shows the full Dirac 
operator spectra for the Wilson fermion and the HF for a typical configuration 
at (3 = 5 on a 4 4 lattice. 




Figure 1: The fermionic spectrum for the minimally gauged HF and for the 
Wilson fermion, in a typical configuration at (3 = 5 on a 4 4 lattice. 



In such a rough gauge background, the overlap formula is only applicable 
if we have an excellent GW approximation to be inserted as D . Fig. 1 
shows that the standard Neuberger fermion, D = Dw, certainly fails, and 



the minimally gauged HF cannot handle such a strong coupling either [32 



4 Note that in such a situation it could be misguiding to consider only the spectrum of 
AqAq, as it is sometimes done in the literature. For the HF in Fig. |l|, that spectrum alone 
would look quite satisfactory, even though the physically crucial left arc is missing. 
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Since we assume the kernel R to be of the form (|2|), the overlap formula 
performs a sort of projection of the eigenvalues onto the GW circle. This 
projection tends to be close to radial [Tj|. Hence we need a statistically 
safe distinction between a left arc and a right arc of the spectrum of D to 
start with, so that it can be mapped onto a GW circle in a reliable way. 
Here we focus on the regime close to the real axis. Eigenvalues have to 
cross the center as one changes topological sectors — since we define the 
topological charge by the index theorem Q — but they have to do so quickly 
as the deformation proceeds, so that the distinction is clear for the typical 
configurations at the coupling strength under consideration. 

However, in the present case the freedom of choosing this center does still 
not help: wherever we choose /i, it will frequently happen that too many 
zeros occur (due to too many mappings to the left), so that the doubling 
problem is back, or that too many eigenvalues are mapped to the right-hand 
side, so that mass renormalization re-appears. Also further generalizations 
of R (beyond the restriction to one site) do not help in this situation. 

A question of interest is at which coupling strength the applicability of 
the overlap formula with simple operators like Dhf sets in. Typical spectra 
on a 4 4 lattice suggest that for instance at ft — 5.4 the coupling is still too 
strong, but at ft > 5.6 we are about to approach to safer grounds, see Fig. 0. 
The same is true for the Wilson fermion (see first Ref. in |[24|| , Fig. 8). On 
larger lattices the minimal ft is still likely to rise to about ft > 5.7. 



Step 2: Critical link amplification 



Worried about the strong mass renormalization, we first want to move 
our HF towards the chiral limit. We do so by amplifying each link variable 
by a factor 1/u, 

U^x) -+ ± , u<l. (7) 

The idea is to compensate the (mean) link suppression due to the gauge field. 

5 This is a sensible definition of the topological charge on the lattice for any Ginsparg- 
Wilson fermion j|. Up to moderate coupling strength, it is often close to the geometrical 
charge |j4j. In general it also tends to agree with the charge identified from cooling the 
SU(N) gauge configurations, especially at increasing N |35). In the case of classically 
perfect fermions it corresponds to the classically perfect charge H, and analogously for 
(quantum) perfect fermions. 



8 




0.5 1 1.5 2 2.5 0.5 1 1.5 2 2.5 



Figure 2: Typical spectra of the minimally gauged HF at (3 = 5.4 (on the left) 
and at (3 = 5.6 (on the right). In the latter case, the overlap formula seems 
to be applicable with mass parameter \i ~ 1.4. 



This can be viewed as a generalization of the critical hopping parameter 
used for Wilson fermions, but it is also related to the spirit of "tadpole 
improvement" [ ftfifl . 

Fig. |3] illustrates that criticality is reached with u ~ 0.8 at (3 = 6. f] 
We see that this already provides a decent approximation to a GW fermion. 
This is remarkable, because we approach the chiral limit in the most eco- 
nomic way, by staying within the framework of minimal gauging. We did 
not introduce additional lattice paths yet, hence Step 2 does not require any 
computational effort (once the critical value of u is determined). 



Step 3: Fat links 



We now want to improve the chiral quality of our HF further by going 
beyond minimal gauging. As a non-minimal element we introduce fat links. 
For a given configurations, we substitute each link variable according to the 
following scheme 



link — > (1 — a) link + — ^staples 

6 L 



a e IR 



6 The absence of the arc very close to is due to the small size of the lattice; it is not 
due to the choice of the lattice Dirac operator. 
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0.5 1 1.5 2 2.5 

Figure 3: HF spectra at [3 = 6 on a 4 4 lattice at critical link amplification 
(u = 0.8) for three configurations. 

before evaluating the Dirac operator. This is still an economic tool. The 
substituted link variable on the right-hand side is not mapped back onto the 



gauge group, in contrast to the APE blocking [38], and we do not iterate the 
substitution (||). 

As a first observation, we note that the mass renormalization is enhanced 
for increasing a. This may seem counter- intuitive if one imagines that a 
positive a makes the configurations "appear smoother". However, a more 
precise picture confirms this observation: the only coupling of range is 
A(0, 0,0,0) = 1.853... The rest of the scalar term is negative, and in the 
free case we have I] x . A(x) = 0. The gauge field now suppresses the nega- 
tive contributions, whereas A(0, 0, 0, 0) remains unchanged, so that a positive 
mass sets in (if we keep u — 1). This is already true for minimal gauging, 
but if we add fat links with a > 0, this suppression of the negative part gets 
even stronger. A fraction of the negative couplings is now attached to staples 
instead of single links, and the staple suppression corresponds to the third 
power of the mean link suppression. 

If we wanted to use fat links to move closer to the chiral limit, we had to 
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take a < 0. Then the critical value of u rises, and for some strongly negative 
a (far below —1) it even arrives at 1, so that criticality could, in principle, 
be realized solely by means of fat links. However, in this case the rest of the 
spectrum is very far from a GW circle — in particular the upper and lower 
arc are far outside the GW circle — so we do not recommend negative values 
of a. Fig. [| (on the left) shows this effect, and we also see that positive a 
are more adequate to make the shape of the spectrum circle-like. So what 





Figure 4: The effect due to the variation of staple coefficient a in the uncrit- 
ical HF (on the left). On the right: the spectrum of the critical HF with and 
without fat links (everything at L — 4, (3 — 6) . 



is really favorable to improve the proximity to a GW fermion is a positive a 
along with an adapted (i.e. decreased) critical value of u. A good choice is 
a = 0.3, which requires u = 0.76 at j3 — 6. This reduces the radial fluctu- 
ation of the eigenvalues — the eigenvalues move closer together — and the 
upper and lower arc move closer to the GW circle. This is shown in Fig. ^ 
(on the right), which compares the spectra with and without fat links for the 
same configuration. 



Step 4: Vector term suppression 



The above picture for the mass renormalization ignores the role of the 
vector term p^x, y, U). In fact, some tests confirmed that it has only a very 
modest influence on the location of the arc on the left-hand side (and also 
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on the right-hand side). This location is essentially determined by the scalar 
term, but the vector term is crucial for the height of the upper and lower 
arc, i.e. the term is responsible for the imaginary part of the spectrum. 

This property will now be used for a further improvement, still without 
extra computational effort. We introduce different link amplification factors 
for the vector term and the scalar term, since they play a different role. We 
first keep the critical factor 1/u as an overall link amplification, but then 
we multiply a link suppression factor v only in the vector term. So now we 
multiply the links as follows: 

v 

-> - Up in Pn(x,y,U) 
u 

U„ - in X(x,y,U) , (9) 

where u< v < 1. Still the fat links are useful to suppress the radial fluctua- 
tions of the eigenvalues, so we stay with a = 0.3, u = 0.76, and the suitable 
vector term suppression amounts to v = 0.92. Now the upper and lower 
arc also follow the GW circle, and we obtain therefore a very satisfactory 
spectrum, see Fig. [5|. It shows again the spectrum on a 4 4 lattice, but this 
time we also include part of the spectrum of a typical configuration on a 8 4 
lattice. From there we show the 100 eigenvalues with the smallest real parts, 
thus we also visualize how the spectrum "continues" around zero. 



A comment on the clover term 



An obvious candidate for a next step beyond minimal gauging is the 
clover term. We performed a sequence of tests with it being added to the HF 
versions discussed above. We varied the clover coefficient and also considered 
both signs, but from spectra on 4 4 lattices we did not arrive at a clear further 
improvement of the GW approximation in this way. A positive coefficient 
has generally the effect to pull the spectrum closer to the real it was 



observed before for the Wilson fermion |37|, hence the optimal parameter v 



rises a little. A positive clover coefficient does improve the arc around zero - 
which appears on the 8 4 lattice — a little bit, but it distorts the rest. As an 
example, we show the effect of the clover coefficient 0.15 in Fig. |]. One could 
argue that it is precisely the left arc which is physically crucial. However, we 
are going to insert our HF into the overlap formula, so we end up with an 
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Figure 5: The HF on a 4 4 lattice with critical link amplification, fat links and 
a suppression of the vector term. We also show the "continuation" around 
on a 8 4 lattice with the same parameters, i.e. u = 0.76, a = 0.3, v = 0.92. 



exact GW fermion anyhow. In view of the convergence rate in the iterative 
evaluation of this formula, the maximal deviation of the HF spectrum from 
the GW circle matters, so one should not limit the attention to the left arc 
only. 

Indeed, the systematic study of the transition to the overlap fermion on 
larger lattices — which will be presented in Section 5 — shows that a clover 
term with a small coefficient may help a little to speed up the convergence. 
So we are going to include it on the 12 4 lattice (used in Sections 5, 6 and 7), 
since it is computationally cheap anyhow. 
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Figure 6: The effect of the clover term in the HF spectrum. 

3 Polynomial approximations of the overlap 
formula 

In four dimensions, the evaluation of the overlap formula is a notorious nu- 
meric problem. It can only be done by some approximation of the non- 
analytic function involved. In particular, one tries to approximate it by a 
polynomial in the relevant interval. Here we apply Chebyshev polynomials 
for that purpose, as suggested in Ref. M, and we consider two possible ways 
to apply them. For a systematic comparison of various approximation meth- 
ods in the case of the Neuberger fermion, see Ref. . We remark that from 
our experience, different types of suitable polynomials do not lead to a very 
different quality of the approximation (at fixed polynomial degree). 
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3.1 Approximations of the sign function 

We first consider approximations of the sign function, which is done in one 
way or the other in most of the literature. One writes the overlap formula as 

D ov = fill +75^=1 =li [1 +lht{H)\ 

H := 75 (A) - fi) = # f (10) 
and approximates the sign function 

e(x) = { \ X> ° n (11) 

v ' y -l x < v ' 

by a polynomial. As an alternative to the Chebyshev polynomials that we 
are going to use here, also an "optimal rational approximation" has been 



suggested |4(J and it was applied for instance in Refs. 

The eigenvalue distribution of H determines the interval which is relevant 
for the approximation. Since this Section is only meant to illustrate what 
the convergence rate depends on, we consider \x = 1 for simplicity. (Later, 
when we study the convergence rate in detail, optimized mass parameters /i 
will be inserted.) 

In Fig. |7| (on the left) we show the eigenvalue histograms for the cases of 
the Wilson fermion, H\y = 7 5 (Z)^ — 1), and of our preferred HF on small 
lattices (a = 0.3, u = 0.76, v = 0.92), still at (3 = 6, on a 4 4 lattice. We see 
that the spectrum of Hhf is already sharply peaked at ±1, whereas for Hw 
the distribution is very broad. 

In order to make the polynomial approximation directly applicable for 
all eigenvalues, we first have to re-scale the spectra so that they are entirely 



confined to the interval [—1,1], see for instance Ref. ||42|| . The outcome of the 
minimal re-scaling (division by the largest absolute value of an eigenvalue) is 
shown in Fig. |7| (on the right). We see that the spectrum of Hhf is not af- 
fected too much: the peaks are moved to about ±0.7, but there is still a large 
gap around zero. This gap is important, because any polynomial approxima- 
tion of the sign function is plagued by its worst errors near the discontinuity 
at zero (remember for instance the notorious "Gibbs phenomenon" of the 
Fourier expansion). On the other hand, for Hw there is (after re-scaling) 
a considerable eigenvalue density around zero. This shows that it requires 
much more effort to transform the Wilson fermion into an overlap fermion. 
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Figure 7: On the left: the eigenvalue histograms for typical spectra of H HF 
(grey) and H w (black) at (3 — 6. On the right: the same after re-scaling so 
that all eigenvalues have absolute values < 1. 

To demonstrate this prediction with an example, we use a linear com- 
bination of Chebyshev polynomials with maximal degree 21 to approximate 
the sign function in the overlap formula (0). We consider again a typical 
configuration at f3 = 6 on a 4 4 lattice. Fig. [8] shows the resulting spectra if 
we start from the Wilson fermion resp. the HF, and we see that the latter is 
clearly superior. 



3.2 Approximation of the inverse square root 

We now come to a second way to approximate the overlap formula in terms of 
polynomials. The idea is not to consider the sign function any more, but to 
approximate directly the inverse square root in eq. @ . This was first applied 
in Ref. ||. If we start from an approximate GW operator D , then the square 
root is close to 1 (or in general close to /i), and the direct expansion around 
this constant looks attractive. We have tested this method in the Schwinger 
model and we found a very fast convergence if we start from the 2d HF, 



but it slows down if we start from the Wilson fermion [IB]. To get started 
with the polynomial approximation, one re-scales the operator Aq so that 
the spectrum of AqAq is all contained in an interval [S,l] (0 < 5 <C 1). 
The virtue of the function to be expanded, f(x) = l/>/x, is its continuity 
in this interval. However, if S becomes very small we approach a singularity, 
which is a problem similar to the discontinuity of the sign function. Hence 
the question here is how small 5 is going to be. Actually the question is 
the same as in Subsection 4.1 (and also the re-scaling is the same), because 
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Figure 8: The spectra of approximate overlap fermions on a 4 4 lattice, where 
the sign function is approximated by a polynomial of degree 21. We use a 
configuration typical at (3 = 6 and start from Dq = D\y (stars) resp. from 
Dq = Dhf (crosses). 

A\Aq = H 2 . So we can refer again to Fig. [7| where one just has to square the 
eigenvalues. 

Also the comparison of spectra looks very similar to Fig. so we do not 
repeat it but turn to a systematic study of the convergence to an overlap 
fermion. 

4 Convergence rate 

We now look explicitly at the convergence of the overlap formula approx- 
imated by polynomials. [] As a measure for the deviation from the final 
overlap fermion, we measure the maximal deviation of any eigenvalue in the 
spectrum of -D from the unit circle with center 1, {z \ \z — 1| = 1 }. For an 
exact GW fermion — with respect to the standard kernel (0) — the spectrum 

7 The results of this and the following Sections have been summarized before in Rcf. 
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is situated exactly on this circle, as we pointed out in Section 1. 

Fig. |] shows this maximal deviation for a typical configuration on a 4 4 
lattice, as a function of the polynomial degree. We consider both, the expan- 
sion of the sign function as well as the expansion of the inverse square root. 
The converge rate is clearly exponential in the degree of the polynomial. 
We see further that the HF converges much faster than the Wilson fermion. 
For the comparison between the two polynomial expansions one has to take 
into account that the square root expansion refers to H 2 , whereas the sign 
function is expanded in H. Hence the former actually picks up a factor of 2 
in the comparison of the polynomial degree, which makes the sign function 
look favorable, especially for the HF. 



L=4 (all eigenvalues) 




20 40 60 80 100 120 
degree of the Chebyshev polynomials 



Figure 9: The maximal radial deviation from the normalized Ginsparg- Wilson 
circle, evaluated from of the full spectrum on a 4 4 lattice. We compare the 
HF at [A — 1 with the Wilson fermion at \i = 1.6, in both cases expanding the 
inverse square root as well as the sign function. 

Since the volume considered in Fig. |9] is quite small, we now proceed to 
a 8 4 lattice. Here we cannot evaluate the full spectrum any more, but with 
the Arnoldi algorithm we can identify a selected set of eigenvalues. We thus 
evaluated the 100 eigenvalues with the least real part (as shown in Fig. |5] 
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before), because they are physically most relevant, and we measured for this 
subset again the maximal radial deviation. In Fig. we plot this maximal 
deviation, and also the mean deviation, again as a function of the polynomial 
degree. For the Wilson fermion we consider two options: we take either the 
free hopping parameter and optimize the mass parameter to fx = 1.6, or 
we take fx — 1, as in the case of the HF, and insert the critical Wilson 
fermion. The goal is to make sure that we compare the HF really to the best 
application of the Wilson fermion. However, as Fig. [K] shows, the difference 
between the different ways to use the Wilson fermion is tiny. 

L=8 (1 00 eigenvalues with least real parts) 



CD 




20 40 60 80 100 

degree of the Chebyshev polynomials 



Figure 10: The maximal radial deviation and the mean radial deviation from 
the normalized Ginsparg- Wilson circle, evaluated from of the 100 energy 
eigenvalues with the lowest real parts (physical branch) on a lattice volume 
8 4 . We compare the inverse square root expansion for the HF and for the 
Wilson fermion. In the latter case we also compare the case of the free hop- 
ping parameter and a suitable mass parameter of fx = 1.6 with the critical 
Wilson fermion at fx = 1 . They behave very similarly, and the HF converges 
much faster. 

Due to the arbitrary truncation at just 100 eigenvalues the exponential 
behavior is not as clean as in the case of the full spectrum. However, we 
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see that the behavior is very similar, and the improvement of the HF clearly 
persists in the same magnitude. 



To make these observations more quantitative, we discuss as an example 
the sign expansion on the 4 4 lattice, which has a very precise exponential 
behavior. For some configuration typical at (3 = 6 we measured for the 
Wilson fermion the maximal deviation d^ ax (n) = exp(— 0.134n), where n is 
the degree of the Chebyshev polynomial, and for the mean deviation we found 
dyy an (n) = 0.13 exp(— 0.134n), hence the exponential factor is the same. 
For the same configuration, the following HF deviations where obtained: 
d%f(n) = exp(-0.737n), and d^ n (n) = 0.1 exp(-0.737n). 

We mention two ways how to arrive at conclusions from these numbers. 



First, we could fix some degree n which we consider affordable in a 
simulation. The precision of the GW approximation compares as 



"W V") ~ „0.6n 



d^f(n) 



e 0bn . (12) 



For realistic degrees like n — 20 . . . 100 this ratio of the accuracies takes 
a very considerable magnitude. 

On the other hand, we could fix a certain accuracy d max which we 
consider necessary to trust the chiral quality of the approximated GW 
fermion. Then the polynomial degrees, which are required to provide 
this precision, compare as 

HW ~ 5.5 . (13) 



n H F 



This factor may be regarded as the effective gain of the HF due to the 
faster convergence, since the computational effort is essentially propor- 
tional to n. 

The fluctuation of these ratios over different configurations are modest, 
even though d max may vary significantly. For a systematic statistical study 
we now move on to larger lattices. 
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5 Condition numbers 



After the explicit convergence study of the last Section, we now turn our 
attention to the condition numbers of the operators AqAo, which are crucial 
for the convergence rate. This allows us to proceed to larger lattices of size 
12 4 , still at (3 = 6. We first show a history for the condition numbers for 
the HF and the Wilson fermion in Fig. |TT] (on top). Here we adapted the 
parameters so that they are optimal on the larger lattice. For the HF the 
new set of optimal parameters reads 

M = 0.75, w = 0.99, a = 0.87, c = 0.01, /i = 1.3 . (14) 

For the Wilson fermion (with the free hopping parameter), \x = 1.64 turned 
out to be optimal with respect to the condition number, though the depen- 
dence on fi is weak — as it is also the case for locality, see Section 6. Hence 
the condition numbers at \i — 1.4 — which is optimal with respect to locality 
— differ only little from the result in Fig. |ll[ 

We see that the HF condition number is improved typically by one to 
two orders of magnitude. Since it is defined by the ratio of the upper bound 
divided by the lower bound, it is instructive to consider these bounds sep- 
arately. Their histories are shown in Fig. |TT] (below). They reveal that the 
improvement of the condition number is almost entirely an effect due to the 
upper bound. 



In Fig. |1J we illustrate explicitly how the condition number translates 
into an accelerated convergence. As a (somewhat arbitrary) measure for the 
speed of convergence, we consider Chebyshev polynomials at some moderate 
degree, n = 60, and measure the deviation of the function f maiX (r = 24) from 
the exact result (obtained from huge values of n). (The quantity f max (r = 24) 
represents the maximal correlation over the largest distance on our lattice; 
an explicit definition will be given in Section 6.) A polynomial of degree 60 
may be affordable in simulations, and we see that it typically approximates 
/ max (r = 24) already to a high accuracy for the HF, but not for the Wilson 
fermion. For the latter we see that fi = 1.64 is better for the condition num- 
ber than the locality optimal mass parameter of /i = 1.4. 

However, it is important to note that most practical applications of over- 
lap fermions are performed such that the lowest few modes are projected out 
and treated separately. Then the above polynomial evaluation concerns the 
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Figure 11: On top: a history of the condition numbers of AqA at (3 = 6 on 
a 12 4 lattice. Below: the corresponding history of the upper and lower bound 
of the spectra. It shows that the improvement of the condition number of the 
HF is essentially due to the upper bound. 

rest of the spectrum. This method helps a lot, because often very few modes 
are responsible for a slow convergence. However, their separate treatment is 
also tedious, because it requires a very accurate determination of the corre- 
sponding eigenfunctions. Hence the number of modes to be projected out is 
usually not more than about 15; beyond that the spectrum becomes quite 
dense, so projecting out further eigenvalues raises the remaining lower bound 
only very little. 

To do justice to this situation, we introduce "higher condition numbers" 
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Figure 12: The precision of the overlap formula approximated by Chebyshev 
polynomials of the moderate degree of 60. As a measure for the precision, we 
show the accuracy of the function f maiX (r = 24) (c.f. Section 6) for a set of 
configurations at L = 12, (3 = 6. We plot this accuracy against the condition 
number, in order to illustrate their monotonous relation, and the progress of 
the HF over the Wilson fermion. 



Cfc, which are defined by the ratio 

largest eigenvalue 



k th eigenvalue from below 



(15) 



If one projects out 15 eigenvalues, for example, then Cig is relevant for the 



convergence of the polynomial evaluation. In Fig. |13]we show the histories for 
the higher conditions numbers c-i . . . c^o, for the HF and the Wilson fermion. 
This plot confirms that around k — 15 the eigenvalue density is large already, 
and beyond they become even more dense. Hence it is hardly motivated to 
increase k much further. We also see that these histories are much smoother 
compared to C\ (which was shown in Fig. 11), so here we are able to take 
sensible mean values. The results are shown in Fig. [14], and we see that in 



the relevant regime the improvement factor for the HF amounts to about 25. 
The convergence rate behaves like the square root of the relevant condition 
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Figure 13: The history of the "higher condition numbers" C2 ■ ■ ■ C20 (defined 
in eq. (\Th])) for the operator AqA , built from the HF (at fi — 1.3) and from 
the Wilson fermion (at fi = 1.64). 

number. f] Hence we gain a factor of ~ 5. This is amazingly consistent with 
the explicit result obtained on the small lattices (but from the full spectrum) 
in Section 4. 



6 Locality 

Since there is occasionally some confusion about the term "locality" of a 
lattice action, we first clarify that we refer to the definition that all correlators 
decay at least exponentially in the distance. This is the property which is 
crucial for providing a safe continuum limit (since the decay width is fixed 
in lattice units). 

It was conjectured || - - and later proved |^3| - that Ginsparg- Wilson 



fermions cannot be "ultralocal" , not even in the free case. This means that 
their couplings cannot drop to zero beyond a finite number of lattice spacings. 

8 Note that the relevant condition number corresponds to 1/S 2 in the notation of Section 

3. 
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Figure 14: JTie expectation values for the higher condition numbers Ck (k = 
2 . . .20) (defined in eq. (\Th])) for the operator AqAq, built from the HF and 
from the Wilson fermion. In the most popular regime (around k = 15) the 
HF gains a factor of f& 25 over the Wilson fermion. For the latter we also 
confirm that \x = 1.64 is a little better for the condition number than the 
locality optimal parameter \x = 1.4. 

However, locality in the above sense was shown for the free perfect fermion 
TM and for the Neuberger fermion H to hold. In Ref. |5| it was shown that 



the truncated perfect free HF leads to an overlap-HF, which is more local 
than the Neuberger fermion (it has a faster exponential decay). 

In the presence of gauge interactions, locality can be proved for smooth 
configurations, either by assuming a small upper limit for the deviation of any 
plaquette variable from 1 || 45[ , or by assuming that the eigenvalues of A\ A 
do not cluster densely in the vicinity of zero 0. For realistic configurations, 
the exponential decay was observed statistically for the Neuberger fermion 
at /3 = 6 0. This property may collapse at much stronger coupling, but 
at some point the overlap formula is not applicable any more also for other 
reasons, as we discussed in Section 2. The statistical demonstration was 
done by showing that the "maximal correlation" between any two lattice 
sites, separated by a taxi driver distance r, decays exponentially in r. More 
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precisely, the expectation value of the function 



Wr) := Zv { WWW I £ W ~ VmI = r } (16) 

has to decay exponentially, if a unit source is located at the (arbitrary) site 
x. We probed 6 sites x for each configuration, then we went through all sites 
y to determine the maxima. The 6 options for x were sufficient to stabilize 
the function / max ( r )- (Remember that this function was mentioned before in 
Section 5.) 

As we explained in the introduction, the property (|J) suggests that the 
higher degree of locality of the overlap-HF may also persist in the presence 
of gauge interactions. In fact, in the Schwinger model a comparison of the 
function (f max {r)) confirmed this conjecture [[HJ. Here we extend this com- 
parison to QCD on a periodic 12 4 lattice, which is the size that was also 
used in Ref. 0. The use of the taxi driver metrics allows us to proceed to 
maximal distance 24, and the exponential decay is clearly visible. f\ Our 
result is shown in Fig. [T5|. For the Neuberger fermion it agrees well with 
the result of Ref. (which was also obtained at (3 = 6), although we used 
a different mass parameter. We inserted fi = 1.64 which was optimal for 
the condition number of the Neuberger fermion (c.f. Section 5), whereas Ref. 
U used fi = 1.4, which is slightly better for the locality — because it was 
optimized with this respect — but the difference is really small. Also for 
the overlap-HF we used the parameter which is optimal for the condition 
number, in that case fi = 1.3. 

We see that the overlap-HF is by far more local. To be explicit, the 
asymptotic decay of (f max (r)) behaves like 0.017 exp(— 0.45r) for the Wilson 
fermion, and like 0.017 exp(—0.93r) for the HF. This observation suggests 
that the range of applicability of the overlap formula extends up to stronger 
coupling for the overlap-HF. 

Interestingly, the very clean exponential decay sets in only in the presence 
of interactions. For free fermions, the decay is faster, of course, but the 
exponential behavior is not as exact as in Fig. O. 



9 Of course, the decay is also exponential with respect to the Euclidean distance. There 
the decay is less smooth, but the asymptotic behavior can be extracted as 0.04 exp(— \x— y\) 
for the Neuberger fermion, and as 0.7exp(— 2 • \x — y\) for the overlap HF. 



26 



0.01 



a 0.0001 



v 1e-06 



1e-08 



1e-10 



Neuberger fermion, n=1 .64 i — h- 
Overlap hypercube fermion, n=1 .3 : 



m + 
a? 

St! 



10 



15 



20 



25 



Figure 15: Comparison of the degree of locality for the Neuberger fermion and 
for the overlap HF. We see that the latter is clearly more local; the exponent 
differs by more than a factor of 2. 

On the technical side we mention that the convergence of / m ax( r ) at long 
distances requires a very precise evaluation of the inverse square root. This 
motivated the use of / max ( r = 24) as a measure for the precision of the 
approximations to the overlap formula, see Section 5. 

7 Approximate rotation invariance 

As a last comparison between the Neuberger fermion and the overlap HF, 
we consider the extent of the violations of rotation invariance. To quantify 
this property, we introduce the function f max (\x — y\), which corresponds to 
definition ([16]) in Euclidean metrics, as well as the corresponding function 
fwm(\x-y\), 



fm^(\x-y\) 



max f 



l[E(^-^) s 



1/2 



\x 
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/mind a; - y\) 



ty { W(v)\\ I [E< 



1/2 



-y\}- (17) 



We put a unit source on one site x and probe all other sites y for a fixed 
Euclidean distance. For each distance \x — y\ that occurs, we determine the 
difference 

(/maxO - V\) - fmin(\x ~ y\)) 



which represents a measure for the violation of rotation invariance. Fig. (IE 
shows that this difference decays exponentially as a function of the Euclidean 
distance. (Again we present data from a periodic 12 4 lattice at (3 = 6.) 
For the Neuberger fermion (at fi = 1.64), the asymptotic decay amounts 
to 0.065 exp(— 0.11 • \x — y\). The corresponding asymptotic decay for the 
overlap HF (at [i = 1.3) behaves as exp(— 2.10 • \x — y\). 



0.01 



5 0.0001 



1e-06 



1e-08 



1e-10 



Neuberger fermion i — h- 
Overlap hypercube fermion : 




6 

[ x- y | 



10 



12 



Figure 16: Comparison of the violations of the rotation invariance for the 
Neuberger fermion and for the overlap-HF. We see that the latter approxi- 
mates rotation invariance much better. In both cases, the violations of rota- 
tion invariance decay exponentially with the distance, but for the overlap HF 
the exponent of the decay is almost doubled. 
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8 Conclusions 



We have constructed a HF for QCD, which approximates the GWR at /3 = 
6. It involves the couplings of the truncated perfect free fermion, along 
with four more parameters which amplify certain links and add fat links as 
well as a clover term. This HF is designed to be inserted into the overlap 
formula, which renders its chirality exact. Since we start off in the right 
vicinity, the convergence under polynomial evaluations of the overlap formula 
is accelerated compared to the standard Neuberger fermion, which uses the 
Wilson fermion as point of departure. We observed consistently over lattice 
volumes 4 4 up to 12 4 an improved convergence rate the overlap-HF, which 
allows us to obtain the same chiral accuracy as the Neuberger fermion if the 
polynomial degree is reduced by a factor > 5. This implies a correspondingly 
reduced computational effort. 

Our experience with the numeric treatment of the HF is based on a simple 
implementation [jTT]| , which first computes (hierarchically) a look-up table 
for all the "hyperlink variables" across (spatial) diagonals inside the unit 
hypercube. This increases the number of SU (3) matrices per site to be stored 
by a factor of 10 compared to the clover Wilson fermion. We further estimate 
that the computational overhead amounts to a factor pa 15. Hence after the 
gain in the convergence rate we are left with an computational overhead of 
a factor of ~ 3. 

However, we should emphasize that the algorithms for the overlap-HF are 
not optimized carefully yet, so there may still be space for a further gain due 
to a better implementation. Moreover, we expect a number of further virtues 
of the overlap-HF compared to the Neuberger fermion, which could well 
compensate this remaining overhead. We have shown that locality is clearly 
improved — the exponent of the correlation decay (in the distance between 
ip and if)) is doubled — which makes the overlap formula applicable up to 
stronger coupling. We have also shown that the overlap-HF approximates 
the rotation invariance very well, in contrast to the Neuberger fermion. 

However, the really crucial question is if our expectation for an improved 
scaling can also be confirmed. If the overlap-HF allows for the use of a some- 
what larger physical lattice spacing, then the remaining overhead could be 
more than compensated. This hope is based in particular on the elements of 
an truncated perfect actions, which are incorporated in our HF construction. 
Indeed, a strongly improved scaling was observed for free fermions and in the 
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2-flavor Schwinger model d = 2 [ I3"|| . We also run some tests for the meson 
dispersions in QCD, but it turned out that the dispersions are quite noisy, 
in particular for the overlap-HF. Hence we postpone the delicate question 
of scaling for further investigation. Interestingly, the phenomenon of more 
noise is also known from 0(a) improved Wilson fermions with a clover term. 
As a general property, GW fermions are also 0(a) improved. Hence it is 
conceivable that the overlap HF has an even better scaling behavior than the 
original HF, since the overlap formula removes the 0(a) artifacts. 

As long as GW fermions can only be applied in the quenched approxima- 
tion, the connection with chiral perturbation theory [EH] seems most attrac- 
tive. However, the ultimate goal is the use of dynamical Ginsparg- Wilson 
fermions, which appears as a tremendous task. Hence it is worthwhile study- 
ing the optimal access carefully. We hope that the current work contributes 
to this optimization. 
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